LASSO+DEA for small and big wide data

نویسندگان

چکیده

In data envelopment analysis (DEA), the curse of dimensionality problem may jeopardize accuracy or even relevance results when there is a relatively large dimension inputs and outputs, for samples. Recently, machine learning approach based on least absolute shrinkage selection operator (LASSO) variable was combined with sign-constrained convex nonparametric squares (SCNLS, special case DEA), dubbed as LASSO-SCNLS, way to circumvent problem. this paper, we revisit interesting approach, by considering various generating processes. We also explore more advanced version LASSO, so-called elastic net (EN) adapt it DEA propose EN-DEA. Our Monte Carlo simulations provide additional some extent, new evidence conclusions. particular, find that none considered approaches clearly dominate others. To in context big wide data, simplified two-step which call LASSO+DEA. proposed could be useful than existing sophisticated reducing very dimensions into sparser, parsimonious models attain greater discriminatory power suffer less from dimensionality.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Small Texts for Big Data

Taking advantage of Big Data while retaining a user-centered point of view is quite difficult. Managing data volume, variety and velocity to extract the relevant information is still challenging. The information extraction needs customization to adapt both content and presentation to fit users’ current profile. Regarding the content, data volume can be reduced and personalized by using user pre...

متن کامل

An Architecture for Security and Protection of Big Data

The issue of online privacy and security is a challenging subject, as it concerns the privacy of data that are increasingly more accessible via the internet. In other words, people who intend to access the private information of other users can do so more efficiently over the internet. This study is an attempt to address the privacy issue of distributed big data in the context of cloud computin...

متن کامل

Big Data, Small World

Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process data within a tolerable elapsed time. In this paper, we have discussed about the various characterstic’s of big data and how data is increasing day by day. There are various aspects of knowledge discovery that are discussed. Moreover, the small world phenomen...

متن کامل

Tupleware: "Big" Data, Big Analytics, Small Clusters

There is a fundamental discrepancy between the targeted and actual users of current analytics frameworks. Most systems are designed for the challenges of the Googles and Facebooks of the world— processing petabytes of data distributed across large cloud deployments consisting of thousands of cheap commodity machines. Yet, the vast majority of users analyze relatively small datasets of up to sev...

متن کامل

Big Data: A Small Introduction

Soho, London, August 1854: seven years before the discovery of germs by Louis Pasteur, people were dying in their hundreds from a mysterious disease called cholera. The wisdom of the time was that cholera was caused by miasma: something bad in the air—a foul fog of disease that would naturally build up in heavily populated areas. But John Snow, a physician working in London, was sceptical of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Omega

سال: 2021

ISSN: ['1873-5274', '0305-0483']

DOI: https://doi.org/10.1016/j.omega.2021.102419